Reducing the Space Requirement of LZ-Index

نویسندگان

  • Diego Arroyuelo
  • Gonzalo Navarro
  • Kunihiko Sadakane
چکیده

The LZ-index is a compressed full-text self-index able to represent a text T1...u, over an alphabet of size σ = O(polylog(u)) and with k-th order empirical entropy Hk(T ), using 4uHk(T )+ o(u logσ) bits for any k = o(logσ u). It can report all the occ occurrences of a pattern P1...m in T in O(m3 logσ +(m + occ) log u) worst case time. Its main drawback is the factor 4 in its space complexity, which makes it larger than other state-of-the-art alternatives. In this paper we present two different approaches to reduce the space requirement of LZ-index. In both cases we achieve (2 + ε)uHk(T )+ o(u logσ) bits of space, for any constant ε > 0, and we simultaneously improve the search time to O(m2 logm + (m + occ) log u). Both indexes support displaying any subtext of length l in optimal O(l/ logσ u) time. In addition, we show how the space can be squeezed to (1+ ε)uHk(T )+o(u log σ) to obtain a structure with O(m2) average search time for m > 2logσ u.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Space-efficient construction of Lempel-Ziv compressed text indexes

A compressed full-text self-index is a data structure that replaces a text and in addition gives indexed access to it, while taking space proportional to the compressed text size. This is very important nowadays, since one can accommodate the index of very large texts entirely in main memory, avoiding the slower access to secondary storage. In particular, the LZ-index [G. Navarro, Journal of Di...

متن کامل

Space-Efficient Construction of LZ-Index

A compressed full-text self-index is a data structure that replaces a text and in addition gives indexed access to it, while taking space proportional to the compressed text size. The LZ-index, in particular, requires 4uHk(1 + o(1)) bits of space, where u is the text length in characters and Hk is its k-th order empirical entropy. Although in practice the LZ-index needs 1.0-1.5 times the text s...

متن کامل

پیچیدگی LZ سیستم های دینامیکی آشوبی و سیستم شبه تناوبی فیبوناچی

  The origin the concept of LZ compexity is in information science. Here we use this notion to characterize chaotic dynamical systems. We make contact with the usual characteristics of chaos, such as Lyapunov exponent and K-entropy. It is shown that for a two-dimensional system LZ complexity is as powerful as other characteristics. We also apply LZ complexity to the study of the quasiperiodic F...

متن کامل

Lozenge directly activates argos and klumpfuss to regulate programmed cell death.

We show that reducing the activity of the Drosophila Runx protein Lozenge (Lz) during pupal development causes a decrease in cell death in the eye. We identified Lz-binding sites in introns of argos (aos) and klumpfuss (klu) and demonstrate that these genes are directly activated targets of Lz. Loss of either aos or klu reduces cell death, suggesting that Lz promotes apoptosis at least in part ...

متن کامل

Water requirement of crops in Khuzestan province

One of the effective steps in use of water resources is accurate estimation of plants water requirement. Lack of accurate estimation of this amount leads to water loss and failure to achieve optimum yield, reduced potential of production, soil resources degradation by too much irrigation or lack of enough leaching and salinization of soils by irrigation less than necessary level, which would ul...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006